Referenceless Quality Estimation for Natural Language Generation

نویسندگان

  • Ondrej Dusek
  • Jekaterina Novikova
  • Verena Rieser
چکیده

Traditional automatic evaluation measures for natural language generation (NLG) use costly human-authored references to estimate the quality of a system output. In this paper, we propose a referenceless quality estimation (QE) approach based on recurrent neural networks, which predicts a quality score for a NLG system output by comparing it to the source meaning representation only. Our method outperforms traditional metrics and a constant baseline in most respects; we also show that synthetic data helps to increase correlation results by 21% compared to the base system. Our results are comparable to results obtained in similar QE tasks despite the more challenging setting.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

RankME: Reliable Human Ratings for Natural Language Generation

Human evaluation for natural language generation (NLG) often suffers from inconsistent user ratings. While previous research tends to attribute this problem to individual user preferences, we show that the quality of human judgements can also be improved by experimental design. We present a novel rank-based magnitude estimation method (RankME), which combines the use of continuous scales and re...

متن کامل

PRF based MR-Thermometry on Abdominal Organs: A pragmatic comparison of referenceless vs multi-baseline

Introduction Reliable temperature and thermal-dose measurements using PRF based MR-thermometry for MR-guided ablation therapy on abdominal organs are complicated by the fact that the target moves through an inhomogeneous and time-variant magnetic field. Two correction approaches emerged recently as the most promising candidates to allow continuous real-time MR-thermometry under free-breathing c...

متن کامل

Resource-Adaptive Model Generation as a Performance Model

Model generation calculi, close relatives of tableau calculi for theorem proving, can be used as competence models for semantic natural language understanding. Unfortunately, existing model generation calculi are not yet plausible as performance models of actual human processing, since they fail to capture computational aspects of human language processing. We outline an extended model generati...

متن کامل

Controlling User Perceptions of Linguistic Style: Trainable Generation of Personality Traits

Recent work in natural language generation has begun to take linguistic variation into account, developing algorithms that are capable of modifying the system’s linguistic style based either on the user’s linguistic style or other factors, such as personality or politeness. While stylistic control has traditionally relied on handcrafted rules, statistical methods are likely to be needed for gen...

متن کامل

Hybrid multi-baseline and referenceless PRF-shift thermometry

Introduction Proton resonance frequency (PRF)-shift MR thermometry is a promising tool for guiding thermal therapies in the treatment of liver tumors and heart arrhythmias, but is complicated by organ motion and respiration. To address motion, multi-baseline subtraction techniques have been proposed [1,2] that use a library of pre-treatment baseline images covering the cardiac and respiratory c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1708.01759  شماره 

صفحات  -

تاریخ انتشار 2017